254 research outputs found

    Functional models and extending strategies for ecological networks

    Get PDF

    Strategies for analyzing bisulfite sequencing data

    Get PDF
    DNA methylation is one of the main epigenetic modifications in the eukaryotic genome; it has been shown to play a role in cell-type specific regulation of gene expression, and therefore cell-type identity. Bisulfite sequencing is the gold-standard for measuring methylation over the genomes of interest. Here, we review several techniques used for the analysis of high-throughput bisulfite sequencing. We introduce specialized short-read alignment techniques as well as pre/post-alignment quality check methods to ensure data quality. Furthermore, we discuss subsequent analysis steps after alignment. We introduce various differential methylation methods and compare their performance using simulated and real bisulfite sequencing datasets. We also discuss the methods used to segment methylomes in order to pinpoint regulatory regions. We introduce annotation methods that can be used for further classification of regions returned by segmentation and differential methylation methods. Finally, we review software packages that implement strategies to efficiently deal with large bisulfite sequencing datasets locally and we discuss online analysis workflows that do not require any prior programming skills. The analysis strategies described in this review will guide researchers at any level to the best practices of bisulfite sequencing analysis

    Strategies for analyzing bisulfite sequencing data

    Get PDF
    DNA methylation is one of the main epigenetic modifications in the eukaryotic genome and has been shown to play a role in cell-type specific regulation of gene expression, and therefore cell-type identity. Bisulfite sequencing is the gold-standard for measuring methylation over the genomes of interest. Here, we review several techniques used for the analysis of high-throughput bisulfite sequencing. We introduce specialized short-read alignment techniques as well as pre/post-alignment quality check methods to ensure data quality. Furthermore, we discuss subsequent analysis steps after alignment. We introduce various differential methylation methods and compare their performance using simulated and real bisulfite-sequencing datasets. We also discuss the methods used to segment methylomes in order to pinpoint regulatory regions. We introduce annotation methods that can be used further classification of regions returned by segmentation or differential methylation methods. Lastly, we review software packages that implement strategies to efficiently deal with large bisulfite sequencing datasets locally and also discuss online analysis workflows that do not require any prior programming skills. The analysis strategies described in this review will guide researchers at any level to the best practices of bisulfite sequencing analysis

    Maximum Entropy Reconstructions of Dynamic Signaling Networks from Quantitative Proteomics Data

    Get PDF
    Advances in mass spectrometry among other technologies have allowed for quantitative, reproducible, proteome-wide measurements of levels of phosphorylation as signals propagate through complex networks in response to external stimuli under different conditions. However, computational approaches to infer elements of the signaling network strictly from the quantitative aspects of proteomics data are not well established. We considered a method using the principle of maximum entropy to infer a network of interacting phosphotyrosine sites from pairwise correlations in a mass spectrometry data set and derive a phosphorylation-dependent interaction network solely from quantitative proteomics data. We first investigated the applicability of this approach by using a simulation of a model biochemical signaling network whose dynamics are governed by a large set of coupled differential equations. We found that in a simulated signaling system, the method detects interactions with significant accuracy. We then analyzed a growth factor mediated signaling network in a human mammary epithelial cell line that we inferred from mass spectrometry data and observe a biologically interpretable, small-world structure of signaling nodes, as well as a catalog of predictions regarding the interactions among previously uncharacterized phosphotyrosine sites. For example, the calculation places a recently identified tumor suppressor pathway through ARHGEF7 and Scribble, in the context of growth factor signaling. Our findings suggest that maximum entropy derived network models are an important tool for interpreting quantitative proteomics data

    Genome-Wide DNA Methylation Profiling in Early Stage I Lung Adenocarcinoma Reveals Predictive Aberrant Methylation in the Promoter Region of the Long Noncoding RNA PLUT: An Exploratory Study

    Get PDF
    Introduction: Surgical procedure is the treatment of choice in early stage I lung adenocarcinoma. However, a considerable number of patients experience recurrence within the first 2 years after complete resection. Suitable prognostic biomarkers that identify patients at high risk of recurrence (who may probably benefit from adjuvant treatment) are still not available. This study aimed at identifying methylation markers for early recurrence that may become important tools for the development of new treatment modalities. Methods: Genome-wide DNA methylation profiling was performed on 30 stage I lung adenocarcinomas, comparing 14 patients with early metastatic recurrence with 16 patients with a long-term relapse-free survival period using methylated-CpG-immunoprecipitation followed by high-throughput next-generation sequencing. The differentially methylated regions between the two subgroups were validated for their prognostic value in two independent cohorts using the MassCLEAVE assay, a high-resolution quantitative methylation analysis. Results: Unsupervised clustering of patients in the discovery cohort on the basis of differentially methylated regions identified patients with shorter relapse-free survival (hazard ratio: 2.23; 95% confidence interval: 0.66-7.53; p = 0.03). In two validation cohorts, promoter hypermethylation of the long noncoding RNA PLUT was significantly associated with shorter relapse-free survival (hazard ratio: 0.54; 95% confidence interval: 0.31-0.93; p < 0.026) and could be reported as an independent prognostic factor in the multivariate Cox regression analysis. Conclusions: Promoter hypermethylation of the long noncoding RNA PLUT is predictive in patients with early stage I adenocarcinoma at high risk for early recurrence. Further studies are needed to validate its role in carcinogenesis and its use as a biomarker to facilitate patient selection and risk stratification

    Chemical Basis of Metabolic Network Organization

    Get PDF
    Although the metabolic networks of the three domains of life consist of different constituents and metabolic pathways, they exhibit the same scale-free organization. This phenomenon has been hypothetically explained by preferential attachment principle that the new-recruited metabolites attach preferentially to those that are already well connected. However, since metabolites are usually small molecules and metabolic processes are basically chemical reactions, we speculate that the metabolic network organization may have a chemical basis. In this paper, chemoinformatic analyses on metabolic networks of Kyoto Encyclopedia of Genes and Genomes (KEGG), Escherichia coli and Saccharomyces cerevisiae were performed. It was found that there exist qualitative and quantitative correlations between network topology and chemical properties of metabolites. The metabolites with larger degrees of connectivity (hubs) are of relatively stronger polarity. This suggests that metabolic networks are chemically organized to a certain extent, which was further elucidated in terms of high concentrations required by metabolic hubs to drive a variety of reactions. This finding not only provides a chemical explanation to the preferential attachment principle for metabolic network expansion, but also has important implications for metabolic network design and metabolite concentration prediction

    Construction of gene regulatory networks using biclustering and bayesian networks

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Understanding gene interactions in complex living systems can be seen as the ultimate goal of the systems biology revolution. Hence, to elucidate disease ontology fully and to reduce the cost of drug development, gene regulatory networks (GRNs) have to be constructed. During the last decade, many GRN inference algorithms based on genome-wide data have been developed to unravel the complexity of gene regulation. Time series transcriptomic data measured by genome-wide DNA microarrays are traditionally used for GRN modelling. One of the major problems with microarrays is that a dataset consists of relatively few time points with respect to the large number of genes. Dimensionality is one of the interesting problems in GRN modelling.</p> <p>Results</p> <p>In this paper, we develop a biclustering function enrichment analysis toolbox (BicAT-plus) to study the effect of biclustering in reducing data dimensions. The network generated from our system was validated via available interaction databases and was compared with previous methods. The results revealed the performance of our proposed method.</p> <p>Conclusions</p> <p>Because of the sparse nature of GRNs, the results of biclustering techniques differ significantly from those of previous methods.</p

    Signatures of Selection in Fusion Transcripts Resulting From Chromosomal Translocations in Human Cancer

    Get PDF
    BACKGROUND: The recurrence and non-random distribution of translocation breakpoints in human tumors are usually attributed to local sequence features present in the vicinity of the breakpoints. However, it has also been suggested that functional constraints might contribute to delimit the position of translocation breakpoints within the genes involved, but a quantitative analysis of such contribution has been lacking. METHODOLOGY: We have analyzed two well-known signatures of functional selection, such as reading-frame compatibility and non-random combinations of protein domains, on an extensive dataset of fusion proteins resulting from chromosomal translocations in cancer. CONCLUSIONS: Our data provide strong experimental support for the concept that the position of translocation breakpoints in the genome of cancer cells is determined, to a large extent, by the need to combine certain protein domains and to keep an intact reading frame in fusion transcripts. Additionally, the information that we have assembled affords a global view of the oncogenic mechanisms and domain architectures that are used by fusion proteins. This can be used to assess the functional impact of novel chromosomal translocations and to predict the position of breakpoints in the genes involved
    corecore